Polynomial quasi-harmonic models for speech analysis and synthesis
نویسندگان
چکیده
Harmonic plus noise models have been successfully applied to a broad range of speech processing applications, including, among others, low bit-rate speech coding, and speech restoration and transformation. In conventional methods, the frequencies, the relative phases and the amplitudes of the pitch-harmonic components are assumed to be piecewise constants over an analysis frame. This assumption is inadequate in segments where fast variations of these parameters may occur, e.g. phoneme-to-phoneme boundaries or speech onsets. In this contribution, a time-varying model of the pitch-harmonic parameter is presented. It is based on a basis expansion technique, consisting in representing the time-varying functions as a linear combination of fixed basis function. An estimation procedure for the parameters of this expansion is presented. Results are provided to demonstrate the effectiveness of this approach. 1. HARMONIC-PLUS-NOISE MODELS Sinusoidal as well as Harmonic-plus-noise models have been applied with success to solve many speech processing problems. Harmonic-plus-noise model (HNM) assumes the speech signal to be composed of a quasi-harmonic part s(t) and a noise part n(t): x(t) = s(t) + n(t) (1) The quasi-harmonic part, in voiced segments, reflects the (local) periodicity of the speech signal. The noise part n(t) accounts for the cycle-to-cycle variations of the glottal airflow, the friction noise, etc... Harmonic plus noise models have proven to be useful in many speech processing applications, including among others low-bit rate speech coding [6], speech transformation (time scaling, pitch scaling) [3], text-to-speech synthesis and co-channel speaker separation [7]. In a quasi-harmonic model, s(t) is represented as a superposition of almost harmonically related sinusoidal components
منابع مشابه
Analysis of Voiced Speech using Harmonic Models
It is very common to consider voiced speech as a periodic or quasi-periodic signal. Therefore, harmonic models are usually proposed for the modeling of this kind of speech. This paper focuses on the harmonic analysis of speech. Three models are presented and compared. The rst model is the simplest among the three models supposing that the speech signal is a stationary signal. The second model d...
متن کاملComparing the Order of a Polynomial Phase Model for the Synthesis of Quasi-harmonic Audio Signals
Sinusoidal modeling has been successfully applied to a wide range of audio signal processing problems, such as coding or time and frequency stretching. While many methods have been proposed for the analysis part of the process, it seems that there is some general agreement concerning the synthesis part in the nonoverlapping case: It is very often achieved by using the wellknown McAulay-Quatieri...
متن کاملOn the harmonic index and harmonic polynomial of Caterpillars with diameter four
The harmonic index H(G) , of a graph G is defined as the sum of weights 2/(deg(u)+deg(v)) of all edges in E(G), where deg (u) denotes the degree of a vertex u in V(G). In this paper we define the harmonic polynomial of G. We present explicit formula for the values of harmonic polynomial for several families of specific graphs and we find the lower and upper bound for harmonic index in Caterpill...
متن کاملStatistical text-to-speech synthesis with improved dynamics
•Apply a DFT of length dTi to the quasi-periodic sequence: –Harmonic frequencies: k = lTi, l = 1, 2, . . . , d –Non-harmonic frequencies: k + 1, k + 2, . . . , k + Ti − 1, where k = 1, Ti, 2Ti, . . . , (d − 1)Ti •Non-harmonic content (NHC) in statistically generated phonemes is much lower, compared to NHC in natural phonemes. Improving Speech Features Dynamics Learning Non-Harmonic Components S...
متن کاملAccurate estimation of sinusoidal parameters in an harmonic+noise model for speech synthesis
We present here an Harmonic+Noise Model (HNM) for speech synthesis. The noise part is represented by an autoregressive model whose output is pitchsynchronously modulated in energy. The harmonic part of the signal is represented by a sinusoidal model. This paper compares di erent methods for separating these two components. We then propose a method for the estimation of the sinusoidal parameters...
متن کامل